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Abstract 

Organisms adapt to fluctuating environments by regulating their dynamics, and by adjusting 
their phenotypes to environmental changes. We model population growth using multitype branching 
processes in random environments, where the offspring distribution of some organism having trait 
t 6 T in environment e £ £ is given by some (fixed) distribution Tt, e on BJ. Then, the phenotypes 
are attributed using a distribution (strategy) 7rt, e on the trait space T. We look for the optimal 
strategy irt,e, t G T , e £ £ maximizing the net growth rate or Lyapounov exponent, and characterize 
the set of optimal strategies. This is considered for various models of interest in biology: hereditary 
versus non-hereditary strategies and strategies involving or not involving a sensing mechanism. Our 
main results are obtained in the setting of non-hereditary strategies: thanks to a reduction to simple 
branching processes in random environment, we derive an exact expression for the net growth rate 
and a characterisation of optimal strategies. We also focus on typical genealogies, that is, we consider 
the problem of finding the typical lineage of a randomly chosen organism. 
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1 Introduction 

Organisms adapt to fluctuating environments by regulating their intrinsic dynamics, and adjust their 
phenotypes or traits to the random environment. Observations reveal that most cell populations are 
heterogeneous, that is, arc composed of various phenotypes. In [5], [TUJ and [12] basic models are 
provided to explain this heterogeneity: Time is continuous, and the related stochastic processes can be 
seen as birth processes for the different phenotypes with migration processes between them in fluctuating 
environments. We first present the models considered in [TIT], and then turn to our new setting. In [TU], 
the trait space T is finite, and the environment space £ is such that £ = T, the idea being that trait i has 
the fastest growth rate in environment e = i. The migration rate matrix = (H^)ij e -r gives the 

(k) 

switching rates from phenotype j to i in environment e — k. The changing environment is modeled 

as an alternating renewal process. The migration rates defines possible strategies to overcome the 
uncertain future. [10] considered two basic strategies: stochastic switching and responsive switching. In 
the first case, the organisms decide to switch the phenotypes independently of the running environment, 

(k) 

they do not use sensors, so that the migration rates can be written as H^- = ffy. Responsive switching 
assumes strong use of sensors, at a certain cost, and the extreme way of responding to environmental 
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changes is obtained by assuming migration rates of the form ifL = H m when j ^ fc, and = when 
i =/= k and j ^= i. In this last situation, organisms opt always for the most favourable phenotype in the 
running environment, that is for the trait having the largest growth rate. Denoting by Xi(t) the mean 
number of organisms having phenotype i at time t, the average X(t) is solution of the random differential 
equation dX(t)/dt — A^u\X{€), where Agns denotes the rate matrix composed of the various growth 
and migration rates in environment £ (t) at time t. The authors then estimate the related Lyapounov 
exponent ln(N(t))/t as t — > oo, where N(t) = YlieS -^»(*) * s the average total population size. The 
computations are performed under the assumption that the environment changes slowly, so that, the 
process has the time to attain an equilibrium during each phase, see (10) for more details. A similar 
model was also considered by [12] for two phenotypes and two environments. The model was studied using 
Monte-Carlo simulations, which indicate that responsive switching is sometimes not the optimal strategy. 
The average proportion of cells having the fastest growth rate is maximized for strategies using a sensing 
mechanism allowing migration to the unfavourable state, leading to heterogeneous populations, providing 
in this way a rationale for population diversification. These models were then studied analytically in [7]. 
Modifications of these models were also considered more recently in [14) where catastrophic events are 
modeled. 

In the present work, we treat similar problems using discrete time multitype branching processes in 
random environments, denoted by MBPRE in what follows, which offer alternatives to the above birth 
with migration processes. The advantage of multitype branching processes is that one can separate more 
clearly the birth and migration phases. We can for example treat in this way populations composed of 
organisms where birth occurs in a very precise period. When this is not the case, discrete time modeling 
still provides relevant informations for population growth. We will also consider the problem of finding 
optimal strategies to maximize the random Lyapounov exponent. Our mathematical approach also 
permits to treat situations where the trait and environment spaces T and £ are continuous: all of our 
results are valid in very general situations, like the models considered in [9]. To be useful for scientists 
not familiar with too advanced mathematics, we first illustrate some results and provide examples when 
both T and £ are finite. 

1.1 Results 

Assume that T = {t\, ■ ■ ■ , t q } and that £ = {ei, • • • , e p }. The process of interest (Z n ) n >o is written 
as a random vector Z n — (ZfytzTi where Z l n denotes the number of organisms having trait t € T at 
generation n. The transition between generations n and n+ 1 is modeled as a two step procedure. First, 
for given t G T, each of the i — 1, • • • ,Z\ l organisms having trait t gives birth to a random number 
of descendants given by a random variable where the index ui n G £ models the environmental 

state at generation n. In the second phase, each of the j = 1, ■ ■ • , new individuals is assigned to a 
random trait t*'^ € T. As usual in such processes, we assume that all of these random variables are 
independent, with i.i.d. £i'n" and i.i.d. r''^™. We further assume that the environmental process {uJ n )n>o 
is fixed, and is supposed to be the realization of a stationary and ergodic process d taking values in 
£. Let T tie and Tt t , e be distributions of the and T*j n for a trait t G T in environment e G £. We 
will often use the first moment mt, e of T t , e , giving the average number of descendants for an organism 
having trait t in environment e. Optimal strategies will be considered for fixed distributions T t , e , that 
is, we will maximize the population growth rate as a function of the distributions n t . e - Sections 3, 4 and 
5 consider extinction and optimal growth questions. Section 6 deals with typical genealogies, that is, we 
consider the question of finding the typical lineage of an individual chosen randomly at generation n. 
We also illustrate these notions both for finite spaces T and £ and for continuous ones. 

As stated in the previous Section, the authors in [10] distinguish between stochastic and responsive 
switching. In the first case, the distributions 7r tje on T depends on t but not on e, so that we can 
write TT tj e = T^t- Concerning the second family of strategies, n t ^ e depends on e but not on t. We will 
also distinguish between several natural situations, namely between strategies involving or not a sensing 
mechanism, and strategies ir tje depending or not depending on t, called hereditary and non-hereditary 
in what follows. We illustrate some results when both T and £ are finite, and for 

1 i.e. for all i £ N, (wo, wi, ...) is distributed as (wi, ui;+i, ...) 

2 i.e. for all bounded Borel function /, W(f(wo,Wi, ...) = f(wi, v)2j •••)) = 1 implies f(uiQ,Wi,...) is constant. 
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1. non-hereditary strategies with no sensing mechanism, where 7r t e = p, for some probability measure 
on T, 



2. and for non-hereditary strategies using a sensing mechanism, of the form Tr t , e = Pe, where p = 
(j>e)ee£ is a family of probability measures on T. This second family contains responsive switching 
strategies. 

We will describe the optimal set for non-hereditary strategies, that is, we will characterize the set of 
distributions (jtt,e) yielding the fastest growth rate. We assume that the random environment (cj„) n >o 
is a stationary process taking values in £ . 

1.1.1 The non-hereditary with no sensing mechanism case 
We first prove in Proposition [T] that 

lim -logE w [|Z n |] =y(p), 

n — >oo Ji 

where the Lyapounov exponent "f(p) is given by 

TO) = E[logm P)tl)0 ], 
and m Pj e is the first moment or average value of the mean distribution 

T p , e = J T t<e p(dt). 

Let V(T) be the set of probability measures on T and 7* = sup{7(p); p € V(T)} be the top Lyapounov 
exponent. We denote by V* the subset of V(T) containing the distributions p maximizing the Lyapounov 
exponent, i.e. 7* = j(p). We show in Proposition [3] that p G V* if and only if 



f ^ Vl (de) < 1, Vt €T. 

J E m p,e 



where v\ is the law of the stationary random environment. A strategy is called pure if it is concentrated 
on a single tgT, that is, takes the form p — 5 t - It then follows that a pure strategy p = St is optimal if 
and only if 

/ U*£^(de) < 1. Vi'eT. 

h m t.e 

When T = {t\, • • • , t q } and £ — {ei, • • • , e p }, with q > p, we also prove that there is a unique maximizcr 
in V* which is supported by a set containing at most p elements. 

As a further illustration, we consider the simplest case when p — q = 2. 

1. p* =6 ta and 7* = ^( ei )log(m t2 , ei ) + n(e2)log(m t2 , e2 ) when ^gf^ + < 1, 

2. p* = <5 tl and 7* = ^(eOlogK^J + ^(e 2 ) log(m tl , e2 ) when + ^t^ 2 " 2 < 1. 

3. Otherwise p* is given by 

* = / ^i(ei)m t2 , e2 ^i(e 2 )m t2 , ei \ ^ / ^i(ei)m tl , e2 vi(e 2 )m tuei \ § 

\ m t 2 ,e 2 - m t u e 2 mt 2 , ei - fnt uei ) 1 \m tu e 2 ~ m t2: e 2 "^ti.ei ~ ^t 2 ,ei / 2 

with 

7* = log|m tl!ei m t2!e2 - m tl , e2 m t2)ei | - ^i(er) log |m t2 , ea - ^ti,e 2 | - ^i(e 2 ) log |m t2 , ei - m tl , ei | 
+^i(ei)log(z/i(ei)) + ^i(e 2 )log(^i(e 2 )). 



3 



As a numerical example, consider the case ^i(ei) = ^i(e 2 ) — 1/2 and 

m tl ,e 1 = 1-5, TOt 3 ,ei = 0.6, m tu e 2 = 0.6, m t2i e 2 = 1.5. 

The two environments are equiprobable, with type t\ better fitted to environment e%, and a symmetry 
in the model. In this case, the Lyapounov exponent for the population without mutation is given by 

7(<M = liM = 0.5(log(1.5) + log(0.6)) « -0.053. 

This implies that both types are subcritical and that the corresponding homogeneous populations extinct 
almost surely. The optimal strategy is then p* = (0.5, 0.5) and the corresponding Lyapounov exponent 
is 

7* = log(0.5 * (1.5 + 0.6)) w 0.049. 

With this optimal strategy, 7* > and the population explodes with positive probability almost surely. 
This is one of the simplest example when polymorphism is a necessary condition for survival (see section 
[331). 



1.1.2 The non-hereditary with sensing mechanism case 

Assume that n t . e — p e for some strategy p = (p e ) e& £. It turns out that the relevant piece of environment 
is given by the pair process u/ 2 ' = ((w„_x, w n ))n>i € £ 2 , of stationary measure i/ 2 (dei, de 2 ). Consider 
the average distribution 

of expected value 

m M2 p ei (dt). 



When the following integral exists 

j(p)= log(m p ea )i/ 2 (dei,de 2 ), 
Js 2 

we prove in Proposition [7] that 

lim rT 1 logE w [|Z n |] = 7(f)) almost surely. 



Let 7** be the optimal growth rate, that is, the supremum of j{p) among all possible families p, and 
denote by V** the related set of optimal strategies. Let ^ e i(de 2 ) be the conditional distribution of w 2 
given that u>x = e%. We prove in Proposition [5] that p is optimal if and only if 



f J^h^ v ei (de 2 )<l, VteT, 



^i(dei) a.s. An interesting consequence is that there is no gain to be expected using a sensing mechanism 
when the random environment has some independence property: If u>\ and u>2 are independent, then 
7* = 7**. The information one can gather on the present environmental state using sensors does not 
help when dealing with future events. 

Next, we develop further the simplest case when £ = {ei,e2} and T = {ii,^}- We suppose that the 
distribution for the environment v is a Markov chain with transitions 

v(u) n+ i = ei\uj n = e\) = 1 - qi v(u n +i = e 2 \uJ n = e x ) = qi 

u(uj n+1 = ei\uj n = e 2 ) = q-2 v{Lo n+ i = e 2 \uj n = e 2 ) = 1 - q% 



where qi € (0, 1) denotes the probability that the environment switches in the next step when it is 
currently in state e^. The sequence u> is then ergodic and stationary distribution 

v\ = ■ o ei H ; d e2 . 

qi +92 qi + 92 
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We suppose furthermore that the Markov chain is started from the steady state, i.e. ujq has distribution 
v\ , so that the sequence u> is stationary. Then, the conditional distributions of u\ given loq is precisely 
given by the transition of the Markov chain. The optimal strategy with sensing is then such that 

1. p** = S t2 when (1 ~ gl)mt1 ^ + < 1, 

^ei 12 mta.ej ™t 2 ,e 2 _ 

2. p* = o t , when - ; — H ^-^ < 1, 

3. and otherwise, 

( (1 - qi) m t 2 ,e 2 ?l"Ha,ei \ r / (1 - qi)m tl .e 2 , «im tl , ei \ 

P ei = 1 Ot! + 1" <>t2) 

\mt2,e 2 - m t u e 2 m t 2 ,e x ~ m t u e x J V m ti ,e 2 ~ m t 2 ,ea m t x ,e x ~ mt 2 ,e x J 

with similar equations for p**. Rather than giving a general formula, we consider the numerical case 
when gi = (72 = 9 so that the stationary distribution is given by fi(ei) = fi(e2) — 1/2 and we use the 
same values for the number of offspring as in Section ["1.1. 11 that is we set 

mt 1( ei = 1.5, m t2 , ei = 0.6, m il)62 = 0.6, m t2ie2 = 1.5. 

Using the above results, we obtain that the optimal strategy p** is given by: 

= and whe, s <? 

5 

P*e* = $t 2 and p* e * = S tl when 9 > - 

** 5-7g ^Jq-2 „ 7g-2 ^ 5-7q 2 5 

Pei = — 3 — o tl + — - — 6 t2 and p e2 = — - — 6 tl + — - — d t2 if - < q < -. 



We sec three different environmental regimes, corresponding to low, intermediate and high frequency 
switching rates. When the environment fluctuates slowly, with q < |, the optimal strategy is pure and 
corresponds to what [10] called responsive switching. In the intermediate regime, the optimal strategy 
is a mixture of pure strategies. In the high frequency regime where environment fluctuates quickly 
(q > 5/7), the optimal strategy is a pure one, where organisms being in the favorable state are pushed 
to the unfavorable state. 

We deduce the optimal growth rate 

f log§- 9 logf if q< f 

7** = <^ qlogq+(l-q)log(l-q)+log% if f < q < f . 
[ logf -glogf if g>f 

When q = 1/2, the sequence u> is an independent sequence, so that 7**(l/2) = 7* (1/2) « 0.049, the 
optimal strategy is a strategy without sensing. When q « 1 (resp. gsO), the environment in the next 
step is very likely to stay the same (resp. to switch), so that we can determine with high probability 
which type will be fitted in the next generation. We observe indeed that 

3 

lim7**(g) = lirri7**(g) =log- « 0.176. 

<? — >0 q^l 2 

2 Definition of the Multitype Branching Process in Random 
Environment and of optimal strategies 

First, we recall that uj n represents the environment at time n, Z n the trait-structured population, Z l n the 
number of individuals with trait t and \Z n \ the total number of individuals at time n. Moreover 
gives the size of the offspring of the i-th individual with trait t at time n in environment u n , and Tj'j^j 
gives the trait of the j-th descendant of this individual. 
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Let us give now the formal definition of the process. 
Let (fi,^ 7 , P) be a probability space and T be a metric space and £ be a Polish space. 
For each pair (i, e) e T x £, let T t . e and n t: e be distributions on N and T respectively and suppose that 
T tie has a finite first moment denoted by m tje . 

Let w = (w„) n >o be a ^-valued stationary ergodic random process with distribution v on £ 

Denoting by N the set of almost null N-valued functions on T and (l t ) t6 r the canonical basis, the 

stochastic process Z n = (Z* : t e T) with values in N can be defined as: 

Z = N , 

tGT i=l j=l 

where 

• iV is a N -valued random variable giving the population at time 0, 

• {£,i,n, i > 1, n > 0} and {t^ Jjrl ; i > 1, j > 1, n > 0} are infinite arrays of iid random variables with 
values in N Tx£ and T Tx£ respectively and common distribution 

T = <g> Tf. e , TT = ® TTt.e 

(t,e)eT x£ (t,e)eT x£ 

respectively, 

• iVp, {^i,n;* > 1)« > 0}, {Tij iTl ;i > l,j i > l,n > 0} and o> = (w n )„>o are independent. 



Thus T corresponds to the offspring distribution and 7r to the distribution of trait of the offspring. 
Both distributions a priori depend on the trait of the parent and on the environment. 
We put \Z n \ = X^teT n > (an empty sum is equal to zero). 

Our attention will be focused on the role of the trait distribution it and how it affects the evolution 
of the population. The offspring distribution T will hence be considered as given and fixed, whereas 
the trait distribution tt will be considered as a parameter. The intuitive idea is that i) the local fitness 
of an individual is determined by its trait and its environment and corresponds to its mean number 
of descendants, ii) the intergenerational variability of the traits has to be tuned so as to maximize the 
global fitness of the whole population. 

The trait distribution tt can be seen as the strategy used by the population to maximize its growth. 
We measure the performance of the strategy n by the long term growth of the corresponding population 
(Z n ) n >o- let 

7(tt) = lim n^logE^jv,, [\Z n \] ■ 

n — >oo 

Here E^jv denotes conditional expectation given the environment and the initial population. As will 
be precised in the sequel, fairly general assumptions ensures that this limit exists and does not depend 
on the environment and initial population. Note that it is important here to consider the quenched 
model (i.e. conditionally on the environment) and note the averaged one (i.e. averaged on all possible 
environments): the criteria 7(71") measures if the population behaves well in a typical environment. An 
averaged criteria would be biased by unlikely environments where the population grows unusually faster. 

Several mechanisms can induce the intergenerational variability of the traits and leads to different 
assumption on the trait distribution ir. 



3 The non-hereditary case with no sensing mechanism 

According to the approach set out in the previous paragraph, we begin by studying the simplest case 
of non-hereditary traits in absence of sensing mechanism, meaning that the trait distribution of the 
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offspring depends neither on the trait of the parent nor on the environment. We thus suppose that Z n 
evolves according to model (TT]) with nt, e = P f° r some distribution p on T, and let n = ir(p) be the 
corresponding product trait distribution. 

3.1 Reduction to a simple BPRE 

In some sense, the non-hereditary assumption makes the structure of the population trivial: the trait 
distribution is given by p whatever the past evolution of the process was. In mathematical terms, we 
take advantage of some stochastic independence: 

Lemma 1. For any n > 1, the population structure is conditionally independent of the past population 
process given the size of the population, i. e. 



\%n\ J_J (Zq, ■ ■ ■ , Z n -i) \Z n 



Proof. It is easily seen from the assumptions on the model (fTJ) and from the non-hereditary assump- 
tion 7r = 7r(p) that the distribution of Z n given (Zq, ■ ■ ■ i? n _i) and \Z n \ is equal to the distribution 
of X)i<i<|z | with Ti an iid sequence with distribution p. This distribution does not depend on 
(Zo, ■ ■ ■ , Z„_i), this proves the conditional independence. □ 



It is worth noting that the result also holds for the quenched model (i.e. conditionally on the 
environment lo). The Lemma implies that the distribution of the population process (Z n ) n >i is easily 
recovered from the size process (|^ n |)n>i- This latter process turns out to be a simple BPRE and this 
allows us to compute the performance j(p) of the strategy p. 

Proposition 1. The size process \\Zn\)7i 

>i is a simple BPRE with offspring distribution in environment 

e given by 

T p , e = J T t , e p(dt), e e £. 
Conditionaly to lo, the expected population size at time n is 



E u [\Z n \]=E U0 [\Z 1 \]]l 



i 

k=l 



with m Pj e the first moment ofT p ^ e . If~f(p) — E [logm p . Wo ] exists, then 

lim n^ 1 logE w [|Z„|] — j(p) almost surely. 

n — >oo 

Proof. According to Lemma [TJ given (|Zi|, • • • , |Z„|), the population Z n has the same distribution as 
^2i<i<\z | 1t» with Ti an iid sequence with distribution p. Intuitively, the i-th individual has type Ti 
chosen randomly on T with distribution p. The size of the next generation \Z n+ i\ is then X)i<i<|z | £n'i " 
where Cnf" is the offspring of the i-th individual of type Ti in environment e n . From this two step 
procedure i) random choice of the type t, ii) reproduction with random offspring in environment e, 
we obtain the effective offspring distribution T p e in environment e: it is the mixture of the offspring 
distributions T t . e , with mixing distribution p. This proves the branching property for (\Z n \) n >i. The 
other properties follow: the branching property implies the recursive formula 

E w [|Z„+i|] = E (wor ..^ n _ l) [|Z„|]m p ^ ii 

and we get the formula for E W [|Z„|] follows. Taking the logarithm, we have 



n-1 

1 logE w [|Z„|] = n- 1 logE^flZil] + n- 1 logm P;t< 

fc=i 



which converges to y(p) almost surely according to the ergodic theorem and the integrability assump- 
tions. □ 
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For the sake of simplicity, we suppose in what follows that the initial population consists in a single 
individual with random trait with distribution p. The whole process (|Z n |)n>o is then a simple BPRE 
with offspring distribution T p and initial condition \Z n \ = 1. Simple branching processes in random 
environment have been introduced by Smith- Wilkinson [11] and Athreya-Karlin [3[ |3] and have been 
studied rather intensively since then [TJ1 [HI H] ■ We recall here some important results concerning the 
asymptotic behaviour of such processes (see e.g. the classification Theorem in [12): it states conditions 
under which either the population becomes extinct or explodes at geometric rate. 

To avoid the trivial case of a constant population, we suppose that T P)Wo is not almost surely equal 
to Si. We suppose also that j(p) = E[logm PjWo ] exists and is finite. We say that extinction occurs if 
Z„ — ► as n — ► co (in which case the sequence vanishes eventually), and that the population survives 
otherwise. Let q(co) — P w [Z n — > 0] be the probability of extinction given the environment. 

Theorem 1. (classification theorem) 

1. In both subcritical case "f(p) < and critical case "f(p) — 0, the population becomes extinct almost 
surely, i.e. P [q(uj) = 1] = 1 

2. In the supercritical case j(p) > 0, if furthermore E [— log (1 — T p . Wo ({0}))] < oo, then the popula- 
tion can survive with positive probability in almost every environment, i.e. P [q(uj) < 1] = 1. 
Furthermore, conditionally on nonextinction, the population explodes at a geometric rate: 



3.2 Optimal strategies 

We now focus on optimal strategies: i.e. what choice of the distribution p allows for the fastest growth 
of the population ? As discussed in Section 2, the performance of the strategy ir ~ n(p) is measured 
by the Lyapounov exponent "f(p). In the non- hereditary case with no sensing, an explicit formula for 
7(p) has been derived in Proposition [TJ The question naturally arise to determine the supremum of 
7(p) for p varying in V(T) the space of distribution on T and the set of optimal strategies p* in the 
case when this supremum is reached. Let 7* = sup{7(p); p 6 V(T)} be the optimal growth rate and 
■p* = |p* g y(T); 7(p*) = 7*} the set of optimal strategies. A strategy p is called pure if p = <5 t for 
some t € T (i.e. all individual in the population have the same trait t) or mixed otherwise. One ask 
further whether optimal strategies are pure or mixed. 

We now precise conditions under which existence of optimal strategies are ensured. We suppose that: 

(Ci) there is some M > such that mt je < M for all (t, e) S T x £, 

(C2) for all e G £, the application 1 1— > mt,e is continuous on T, 

(C3) for all e € £ and e > 0, there is a compact set K C T such that m t , e < £ for all t € T\ K 

Condition (Ci) is rather relevant from the biological point of view since an individual could hardly have 
arbitrary high number of offsprings in a fixed amount of time. Conditions (C2) and (C3) are related with 
the topology of T: individuals with close traits are supposed to have approximately the same behaviour, 
and traits close to infinity are supposed to have a poor fitness. 

Proposition 2. Under conditions (Ci) — (C3), optimal strategies exists and form a closed convex set 
i.e. the set V* is nonempty closed and convex in V(T) endowed with the topology of weak convergence. 
If furthermore the family M — {e 1— * m t , e .t € T} C L°°(£, v\) is linearly independent, then the optimal 
strategy is unique. 

Proof. First, we prove that the map 7 : V(T) — > M defined by 



is concave and upper semi-continuous on V(T) with respect to the topology of weak convergence. Let 
p n converging weakly to p in V(T). Using conditions (Ci) and (C2), we see that e - * m-p.e for all 



lim n 1 \ogZ n — "f(p) almost surely on {Vn > 0, Z n > 0}. 
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e E £. Define / n (e) = log(M) — log (m Pnte ) be non-negative measurable functions on £ and apply Fatou's 
Lemma: we get 

J limrnf f n {e)vi(de) < liminf J f n (e)vi(de). 

Equivalently, 

limsup J log(m Pn , e ) vi(de) < J log (m p>e ) vx{de). 

Hence limsup7(p„) < ^(p) and this states the upper semicontinuity of the application 7. The concavity 
of the application 7 on V(T) is a direct consequence from the concavity of the logarithm and the linearity 
of integration: first for p\,p 2 € V(T),X E [0, 1] and e E £, we have 

log (m Api+ (i-A)p 2 ,e) = log(Am Pl! e + (1 - A)m P2:e ) > Alog(m Pl , e ) + (1 - A)log(m P2i£ ) . 

This implies j(Xpi + (1 - X)p 2 ) > Xj(pi) + (1 - X) 1 {p 2 ). 

We then prove that V* is nonempty closed and convex. The closeness and convexity properties are 
straightforward since V* can be seen as the level set {p £ V(T);j(p) > p*} of the concave uppersemi- 
continuous application 7. It remains to check non-emptiness and we use compactness arguments. In the 
case when T is a compact space, then V(T) is also compact with respect to the weak topology and the 
upper semicontinuous map 7 reaches its maximum on V(T) so that V* is non-empty. In the case when 
T is non-compact, we consider its compactification T = T U {00}. We extend the definition of m by 
™oo,e = for all e € £. Condition (C3) ensures that this extension is continuous on T. Then the map 
7 extends on the compact space V(T), is uppersemicontinuous and hence reaches its maximum at some 
point p* £ V(T). It remains to check that p*({oo}) — so that p* can be seen as an element of V(T). 
It is straightforward since p*{{oo\) > would imply 7(p*) = — 00. 

At last, we prove the uniqueness in the case when the family M. is linearly independent. Let p\ and 
P2 be two points where 7 reaches its maximum. Using the strict concavity of the logarithm, we see that 
necessarily m p * tC — m p *. e ^i(e?e)-almost everywhere. Using the linear independence, this in turn implies 

P*=Pt ' " □ 

The following characterization of optimal strategies can be useful: 
Proposition 3. A strategy p is optimal if and only if 



I — Vl {de) < 1, Vt ET. 

h m P,e 



Proof. The strategy p is optimal if and only if 

7((1 - e)p + ep') - j(p) < for all p' and < e < 1. 
Using concavity and differentiability, this is equivalent to 

d 



de 

which can be rewritten as 

d 
d7 



h(p + s( P ' -p)l o <0, 



log(m Pi e + e(TOp', e - m Pl e)) v\{de) 
Thus a necessary and sufficient condition is 



|e=0 J£ " l P>e 



J£ m p,e 



It is easily seen that it is equivalent to test the condition for p' ~ S t , t E T and this proves the announced 
result. □ 

As a direct application of Proposition [31 we can answer the question wether there is a pure optimal 
strategy: 
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Corollary 1. The pure strategy p = St is optimal if and only if 

/ —L*. Vl (de) < 1 , Vt e T. 

h m t,e 

This gives us a simple criterion for the existence of optimal pure strategies. 

3.3 Extinction 

According to Theorem [TJ on the one hand the population becomes extinct in the case when 7(p) < 
and on the other hand (under additional technical conditions) the population survives and explodes with 
positive probability when ^(p) > 0. For the population's survival, strategies p with "f(p) > are of 
highest importance. This leads to the definition of the set S of strategies that allow the population to 
survive: 

S = {peV(T); 7 (p) >0}. 

The following property holds: 

Proposition 4. If j* < 0, the set S is empty. If j* > 0, S is a nonempty convex set containing V* . 

Proof. From the definition of 7*, j(p) < 7* for all strategy p. Hence S is empty if 7* < 0. If 7* > 0, 
then every optimal strategy p* € V* satisfies y(p*) — 7* > and hence V* C S. The map p 1— > 7(p) is 
concave function so that the level set 5 = {p£ 'P(T); 7(p) > 0} is convex. □ 

In some cases, a striking phenomenon may happen: no pure strategy can allow for survival, i.e. every 
homogeneous population with a single trait t suffers from extinction; but some mixed strategies may 
prevent from extinction, i.e. some polymorphic populations may survive forever. In this case, we should 
say that polymorphism is a necessary condition for survival. This phenomenon occurs when j(5t) < 
for all t 6 T whereas 7* > 0. 

The intuitive idea is the following: extinction occurs when the environment is bad for almost all the 
individuals in the population and hence a diversification of the traits in the population should imply a 
smaller number of environments that are bad for almost all individuals. This can be seen as a consequence 
of the concavity property: suppose that for all trait t, 7(^4) = 7, i-e. homogeneous population have the 
same growth rate no matter the trait t. Then for any strategy p, j(p) > f T ~f(t)p(dt) — 7, i.e. any 
polymorphic population has a better growth rate than any homogeneous population. See the examples 
below for further illustration of this phenomenon. 

3.4 Example: finite-dimensional case 

We consider the case when the vectorial space spanned by the family M. = {e 1— ► m t , e , t £ T} in L°°(£, V\) 
has finite dimension denoted by d. This occurs in particular as soon as: 

- cither the environment space is finite: £ — {e\, • • • , e p } and v\ is a discrete measure such that ^i(ej) > 
for alH € {1, • • • ,p}. In this case L°°(£, v\) is of dimension p and d < p; 

- or the trait is finite: T — {ti, ■ ■ ■ ,t q }. Then d < q and conditions (Ci) — (C3) are automatically 
satisfied. 

For convenience, we require furthermore that the following conditions holds: 

(H) for any pairwise distinct ti, ■ ■ ■ € T, the family of functions {e 1— * m ti-e ; 1 < i < d} is linearly 
independent in L°°(£,^i). 

This property will be convenient because it ensures the uniqueness of the optimal strategy. More precisely, 

Proposition 5. Suppose that span.M has dimension d and that conditions (Ci) — (C3) and (H) hold. 
Then: 

• there exists a unique optimal strategy p* S V(T), 

• p* is a discrete probability on T supported by at most d different types. 
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Proof. Existence of an optimal strategy is a consequence of Proposition^] However it is worth noting that 
in this finite-dimensional case, it is a consequence of standard analysis. We denote by Conv(Al) the closed 
convex hull of M. in L°°(£, v\) which can be seen as a closed convex set in a rf-dimensional vector space. 
Note that Conv(A^) is equal to the set of functions {m p : e i— > m P:e \ p € "P(T)} where to PiC is defined in 
equation J2]). Introduce the application 9 : Conv(A^) — > [—00, 00) defined by 0(m) = J £ \og(m(e))vi{de) 
(with the convention that log(u) = —00 if u < 0). With these notations, we easily see that ^(p) — 0(m p ) 
and maximizing 7 on V(T) is equivalent to maximizing 8 on Conv(.M). Next we observe that the func- 
tion G is uppersemicontinuous and strictly concave, so that it reaches its maximum at a unique point m* 
on the compact set conv(.M ). Furthermore the gradient of O never vanishes so that the extremum must 
be reached on a boundary point of Conv(.M) and m* € 9Conv(.M). The boundary of Conv(A^) consists 
in extremal types (type t such that m t : e 1— > m tje is an extremal point of the convex set Conv(A'J)) and 
of A:-dimensional faces determined by the convex hull of k + 1 extremal types, with < k < n. If m* 
belongs to such a /c-dimensional face, then m* = m p * where p* is a discrete probability on T supported 
by the k + 1 corresponding extremal types, with k + 1 < d. The uniqueness property comes then from 
assumption (H). Indeed, under this assumption, a k dimensional face determined by fc+1 extremal types 
does not contain any other point of M. so that the decomposition in barycentric coordinates are unique. □ 

It is worth noting that the application has always a unique minimizer m* in the above proof. Condition 
(H) ensures that there is a unique strategy p* such that m p - = m* . In absence of this condition, there 
might be several mixing distribution p such that m p = m* and then several optimal strategies. 

An appealing particular case is the case when both £ — {ei, • • • , e p } and T = \t\, • • • , t q } are finite 
with q > p. In this case L°°(£, V\) is of dimension d — p, and a generic configuration will always satisfy 
assumption (H) (in the sense that an exact linear relation between the types is very unlikely from a 
biological point of view). Then, according to Proposition [5l there exists a unique optimal strategy mix- 
ing at most p different types. This shows that the number of types involved in the optimal strategy is 
less than the number of different environments. This is reminiscent from a rule in ecology stating that 
the number of species in an ecosystem is bounded above by the numbers of different niche in the sense 
that two species cannot occupy the same niche for a long time (competitive exclusion principle). From 
a practical point of view, in this finite settings, the optimal strategy can be computed using standard 
numerical convex optimization (see [6] for instance). For more illustrations in this setting, see Section 

nxu 



3.5 Example: Gaussian distributions 

This example is due to Haccou and Iwasa [5]: the environment space is the set of real numbers £ = M, 
the environment u> is supposed to be a Gaussian stationary ergodic sequence with stationary distribution 
v\ = AT([J,, tr|) the Gaussian distribution with mean (igK and variance u\ > 0. The dynamic for the 
environment is irrelevant in the no-sensing case. The trait space is the set of real numbers T = IR and 
the mean offspring number of an individual of trait t in environment e has the gaussian form 

C ( (t-ef 

m t , e = -. y exp 1 



for some parameters C > and o~\ > 0. In environment e, individuals with trait value t = e are the 
best fitted. Note that conditions (Ci) — (C3) are fulfilled so that Proposition [5] holds: optimal strategies 
exist. Furthermore, the family A4 = {e *—>■ m t , e \t G T} is linearly independent so that unicity hold: 
there is a unique optimal strategy p* depending a priori on /i, a\, 02 and C. 
First we easily compute the fitness of a pure strategy 



l(S t ) = J log(m tje )i/i(de) = logC - ilog(27rCT?) 



(fx - tf + o-l 



2a{ 

and the optimal pure strategy is equal to 5^. Then according to Corollary [TJ we test if this pure strategy 
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is optimal in the set of all mixed strategies. For this we compute 



L 



— '—ui{de) = / exp — -2 vi{de) = exp 1 



2o( J V M 



and check that it is less than one when 02 < o\ ■ Hence if the fluctuations of the environment are small 
(er 2 < <xi), then the pure strategy <5 M is optimal. 

Gaussian strategies p = J\f(n P ,Op) yield easy computations: recalling that a Gaussian mixture of 
Gaussian distributions is a again Gaussian, we compute 

f ,,s C ( (u. 

rrip^e = / m t , e p{dt) = exp - 2 , 2^ 



2ir(al+al) V 2(<7? + of) 



and then 

7 (p)= / log(m t , e )^(d e )=logC-ilog(27r(«7f +(7=)) ^ 



We find that "f(p) is maximal (among Gaussian strategies) for [i v = /j, and 



a 2 



if o-l < o\ 

■ ~ \ of - o\ if of > of 



We have seen that this is the optimal strategy in the case when erf < o\. This is still the case when 
erf > o\ : we compute indeed for t S T and p* = 7V(£t, erf — of) 



and this characterizes the optimal strategy according to Proposition [31 
We have proven so far that: 

Proposition 6. i/ie optimal strategy is 



P AA( M ,a 2 2 -(7f) if of > of ' W 



and £/ie optimal growth rate 



f logC - I logger?) - ^ if af < cr? 
\ logC-ilog(2wf)-i if erf >cr? 

An interesting quantity is the relative gain of the best mixed strategy over the best pure strategy: 
it gives an indication of the strength of the selection pressure on mixed as opposed to pure strategies. 
From the above computation, denoting by x = °i /°i> we get 



lip*) - l(dfi) = 



if x < 1 

|(x-l-logx) ifx>l 



This is the log-ratio of the expected long-term growth rates of individuals playing the different types of 
strategies. It is non-decreasing with respect to x : when the environmental variance erf is large compared 
to of, there is a strong advantage in playing a mixed strategy. 

Next we illustrate the phenomenon discussed in section f3 .31 when polymorphism is a necessary condi- 
tion for survival. This happens when the optimal pure strategy <5 M leads to almost sure extinction of the 
population (7(<5 M ) < 0) whereas the optimal mixed strategy p* allows the population to survive (7* > 0). 
This phenomenon occurs when er 2 > o\ for every C belonging to the following non empty interval: 

~ l0g(27T0f ) + i < lQgC < i log(27TO?) + ||. 
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4 The non- hereditary case with sensing mechanism 



We now study the case of non-hereditary traits when some sensing-mechanism is available: we mean 
that the trait distribution of the offspring does not depends on the trait of the parent, but does depend 
on the environment because the individuals get some information about the environment they evolve in 
and are able to adapt suitably the trait distribution of their offspring. We thus suppose in this section 
that Z n evolves according to model ((T|) with 7r t e = p e for some family p — {p e )e£S of distributions on T 
and let 7r = n(p) be the corresponding product distribution. 



4.1 Reduction to a simple BPRE 

In a similar way as in the non-hereditary case with no sensing, the non-hereditary assumption makes 
the structure of the population very simple: the trait distribution at time n is given by the strategy the 
parents followed at time n — 1 in environment w n _i, that is J? w „_i ■ This is the expressed in the following 
Lemma: 

Lemma 2. For any n > 1, the population structure is conditionally independent of the past population 
process given the size of the population and the environment, i.e. 

(\Z n \,ui n -x) [J (Z ,--- ,Z„_i) (\Z n \,uj n -x). 

Proof. It is easily seen from the assumptions on the model ([I]) and from the non-hereditary assumption 
7r = 7r(p) that the distribution of Z n given (Zq, ■ ■ ■ Z n -\), \Z n \ and w n -i is equal to the distribution 
Of Ei=l It* With Tj an iid sequence with distribution p tl i n _ 1 . This distribution does not depend on 
(Zq, ■ • • , Z„_i), this proves the conditional independence. □ 



The Lemma implies that given the environmental sequence u>, the distribution of the population 
process (Z n ) n >i is easily recovered from the size process (|^n|)n>i- Once again, this latter process turns 
out to be a simple BPRE and this allows us to compute the performance j(p) of the strategy p. Let 
a/ 2 -* = ((w n _i, u>n))n>i denote the pair-environment with values in £ 2 . It is also stationary and ergodic 
and we denote by v 2 its stationary distribution, which is the distribution of the pair (wi,^)- 

Proposition 7. The size process {\Z n \) n >x is a simple branching process in environment a/ 2 ) with 
offspring distribution 

T P,(ei,e 2 ) = J^t,e 2 Pe 1 (dt). 

Conditionaly to to, the expected population size at time n is 

fe=i 

with m Pq , E2 = J T mt.e 2 Pe 1 {dt) the first moment of Tp .( ei ,e 2 )- Suppose the following integral exists, 

lip) = \ log (m p e 2 ) V2(de 1 ,de 2 ), 



n-l 

fc=i 



then 

lim n^ 1 logE w [|Z Jl |] — j(p) almost surely. 

n — >oo 

Proof. According to Lemma [5J given (\Z%\, ■ ■ ■ ,\Z n \) and w n _i, the population Z n has the same distri- 
bution as ^-n with n an iid sequence with distribution p Un _ t . Intuitively, the i-th individual has 

type Ti chosen randomly on T with distribution p UJn _ 1 . The size of the next generation \Z n+ \\ is then 

Si=i ^nf™ where is the offspring of the i-th individual of type n in environment tu n . From this 

two step procedure i) random choice of the traits t according to p u , n _ 1 , ii) reproduction with random 
offspring in environment ui n , we obtain the effective offspring distribution , UJn in environment 
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(2) 

LOn = (ci) n _i,w n ): it is the mixture of the offspring distributions Tt itl)n , with mixing distribution p UJn _ 1 ■ 
Other properties are proved as in Proposition [1] □ 

Note that the classification Theorem Q] applies in this case as well and gives criteria for extinction 
or explosion of the population. 

4.2 Optimal strategies 

We now describe the set of optimal strategies when sensing is allowed. Is is also interesting to evaluate the 
gain between optimal strategy with or without sensing mechanisms. Optimality when sensing mechanism 
are allowed will be denoted with a double asterix whereas we keep a simple asterix for optimality without 
sensing. Let 7** = sup{7(p); p G V(T) £ } be the optimal growth rate when sensing is allowed and V** = 
{p G V{T) £ ; 7(p) = 7**} be the set of optimal strategies. Let v ei (de2) be the conditional distribution 
of u>2 given u>x = e±, so that v 2 (dex,de 2 ) — i/x(dei)v ei (de 2 ). Note that conditional distributions are 
well-defined since £ is assumed to be a Polish space. From the previous section, the set of optimal 
strategies without sensing is denoted by V* . It depends implicitly on the environment distribution v\ 
and we write V*{i>\) to emphasize this dependence. Suppose assumptions of Proposition [2] hold. Then 
optimal strategies with sensing are related with optimal strategies without sensing in the following way: 

Proposition 8. A strategy p is optimal if and only if 

Pei G V*(i'e 1 ) vx(dei) almost everywhere. 

If conditions (Ci) — (C2) hold, then optimal strategies exists and form a closed convex set i.e. V** is 
nonempty closed and convex. 

If the family M = {e 1— > m t e ;i G T} is linearly independent, there is a unique optimal strategy p** . 

Proof. Using the explicit formula for 'yip) given in Proposition [7] and conditional probabilities, we 
compute 

lip) = / log (m Pei) e 2 ) v 2 (de 1 ,de 2 ) 



u\{de\) / log (m Pci! e 2 ) v ei {de 2 ) 

7(Pei,fei)i / l(rfei) 



£ 



whith 7(peu ^ei) — log ( m p ei ,e 2 ) v e 1 {de2) the growth rate associated with strategy p ei in environment 
i/ ei . Hence 7(75) is optimal if we choose p ei so that j{p ei ,v ei ) is maximal, i.e. p ei G T , *{v ei ) almost 
surely. The properties of V** and the uniqueness follow from the similar results for V* (cf Proposition 
0). □ 



Then, the characterization of optimal strategies in the no-sensing case given in Proposition directly 
extends to strategies with sensing as follows: 

Proposition 9. A strategy with sensing p G V{T) £ is optimal if and only if 

I t,e 2 ^ [de 2 ) < 1, for all t G T and V\(dex) — almost everywhere. 

J£ m p ci ,e 2 

Proof. The characterization follows directly from Proposition [3] and Proposition [5] together. 

An interesting corollary states that no gain has to be expected from sensing mechanisms if the 
environment has some independence property. More precisely, 

Corollary 2. Suppose that u)\ and lu 2 are independent (i.e. v 2 = v\® v\), then the optimal growth rate 
with or without sensing are equal, i.e. 7* = 7**. 
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Proof. In this product case, the conditional distribution are trivial, i.e. v ei = v\ almost surely. Hence 
an optimal strategy is such that p* e * <E V*(y\) and hence — 7*. Integrating with respect to 

vi{dex), we obtain 



Note that this result is rather intuitive: the sensing mechanism gives to the individual some infor- 
mation about the current environment state, but from the independence property, this is not useful for 
inference to the future environment state; hence the information is useless to decide which traits will be 
well-fitted in the next environment. 

4.3 Example: Finite dimensional case continued 

The results for optimal strategies without sensing developed in Section 13.41 together with Proposition 
[3] allow us to easily deduce the following properties for optimal strategies with sensing in the finite 
dimensional case. 

Proposition 10. Suppose that spamVl has dimension d and that conditions (Ci) — (C3) and (H) hold. 

Then there exists a unique optimal strategy with sensing p** £ V(T) S such that 



where p*(v e ) is the optimal strategy without sensing from associated when the environment has marginal 
distribution v e . 

Recall furthermore from Proposition [5] that p*(v e ) is a discrete probability measure on T with at 
most d extremal types (that may depend on e). See Section [1.1. 21 for more examples. 

4.4 Exemple: Haccou and Iwasa's example continued 

This is the sequel of subsection 3.5, the example by Haccau and Iwasa. Recall that the environment is 
given by a Gaussian stationary ergodic sequence to = (w n )n>o with stationary distribution v\ = J\f((i, a 2 ). 
Let p G (—1, 1) be the pair correlation p = corr(wo,wi). The case p = corresponds to independent 
environments, i.e. v?, = V\ (8 v\. Otherwise dependence holds and standard Gaussian computations give 
the conditional distribution v ex = Af(p,+ p(e\ — /x), (1 — p 2 )c|). A particular realization of such a sequence 
is the Ornstein-Uhlenbeck sequence defined by 



for i.i.d. standard normal innovations (^ n )n>o (Gaussian white noise). 

We have seen in section 3.5 that the optimal strategy without sensing is Gaussian (possibly degener- 
ated) and we have given explicit formulas for the parameters ([3]) and for the corresponding growth rate 
(H|). According to Proposition [SI we deduce the optimal strategy when sensing is allowed: 

Proposition 11. There exists a unique optimal strategy with sensing, which is denoted by p** and 
satisfies v\(de)-a.e. 




This proves the result. 



□ 



p**(e)=p*(v e ) 





if (1 
if (1 



p>2 < a 2 

p 2 )al > <j\ 



(5) 



The corresponding optimal growth rate is given by 




logC-ilog(2w 1 2 )-^^ 
logC-ilog(24l-p 2 )a 2 2 )-i 



if (1 
if (1 



p 2 )al < a\ 
P 2 )4 > °\ 
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Finally, we can evaluate the relative gain of strategies with sensing over strategies without sensing: it 
gives an indication of the benefit that can be expected from sensing mechanisms. Let \ = a \l°~\- Then 

{ \p 2 X if X < 1 

7**-7* = < |logx-i(l-p 2 )x+| if 1 <X< (I-P 2 )- 1 ■ 
{ -±log(l-p 2 ) if X^il-P 2 )- 1 

It is worth noting that this is an increasing function of the square correlation p 2 : this indicates that the 
more correlated the random environment is, the more useful sensing mechanisms are. The intuitive idea 
is that higher correlations allows for more accurate prevision for the next environment and hence for a 
better fitted offspring trait distribution in the environment to come. 



5 The hereditary case 

In the hereditary case, the trait distribution 7r t e £ 'P(T) might depends on the trait of the parents. 
This dependency makes the study of the Lyapounov exponent 7 = 7(77) much more difficult because no 
reduction to a simple branching process in random environment is available. We have no explicit formula 
for 7(77) in this case and determining the optimal strategy 7r* and the optimal growth rate 7(71*) might 
be very challenging. 

Nevertheless, we propose an interesting representation of the "finite time" growth rate 

7„(w,7ro) = rT 1 logE W:Tro [|Z„|] 

in environment u) and initial population consisting of a single individual with random trait with distri- 
bution ttq. The representation is in term of a functional of the T-valued Markov chain T = (T„) n >o in 
environment u) such that: 

\ P^CZfce ■\T = t ,->- ,T fe _! =t fe _i) =7T 4fc _ 1>£l)fc _ 1 (-) , l<k<n ■ w 

The Markov chain T n in random environment lu is time-heterogeneous because the transitions depend on 
time n through the value of the environment uj n . However, in the no-sensing case when the transitions 
Tt t , e = TTt do not depend on e, the Markov chain T is time-homogeneous. The result is the following: 

Proposition 12. The finite time growth rate in environment to and initial population consisting in a 
single individual with trait distributed according to ttq is given by 

7„(w,7r ) = n -1 logE W!iro [M n (T,a;)] 

with 

n-1 

M n (T,uj) = Y[ m Tk ^ k - 

k=0 

For the sake of clarity and conciseness, this proposition will be proved together with Theorem [2] 
below. To our best knowledge, there is no simple way to deal with the asymptotics behaviour of j n and 
this issue might be challenging. We provide a lower bound for the growth rate 7 n (w,7To) that might be 
more tractable. The lower bound for 7„(cj,7ro) is obtained using Jensen's inequality (with the concave 
function log): 

7 „(c;,7r ) = n-MogE^jM^T,^)] > n^E^ [log M n (T, u)]. 
The advantage here is that the lower bound 

n-1 

n _1 E Wi7ro [logAf„(T,u;)] = E Wi ^ |logmr fc ,w*] 

fc=0 

can be written as an additive functional of the environment to and we can then use the ergodic theorem to 
control the convergence. Note that this technique can be refined using changes of measures. Let jrt, e an y 
kernel family and denote by T the T-valued Markov chain in environment to starting from distribution ttq 
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and with transitions given by jt t ,e (given by equations similar to ©)■ We suppose that n t: e is absolutely 
continuous with respect to Tr t , e , i.e. ir t , e (dt') — ft,e{i')^t,e{dt') where ft tC stands for the density of 7Tt je 
with respect to TTt,e- Then, using changes of measures, we have 

ui^o [Mn(T, Co)] 

with 

n-1 
k=0 

Using this, the lower bound becomes 

n-1 

7„(w,7r ) = ra _1 E w , ff JlogM„(T,w)] > n" 1 ^E W: „ log (m,f k , Uh ff k ,u> k C^ k +^) ' 

k=0 

6 Typical genealogies: a mean field approach 

6.1 Convergence of the typical genealogy in the infinite population limit 

As explained in Baake and Georgii [U [5] , the evolution of a branching population can be studied from 
two possible perspectives: either forward or backward in time. So far, we have focused on the first point 
of view and mainly studied the growth rate of the population after a large numbers of generations. By 
way of contrast, the backwards or retrospective aspect of the population concerns the lineages extending 
back into past from the presently living individuals and asks for the characteristics of the ancestors 
along such lineages. We now turn to this second perspective and wonder what is the typical lineage or 
genealogy (backward in time) of an individual chosen at random in the n-th generation. 

Some definitions are needed here. The right formalism to keep track of the genealogy is the formalism 
of labeled rooted trees and forests, where the trees stands for the descendence of each ancestor represented 
by a root, and labels keep track of the traits of the individuals. However we keep this formalism to its 
minimum. Let Q n denote the population at the n-th generation. To each individual g £ Q n , we associate 
its lineage or genealogy £(g) = (io, • • • , t n ) 6 7" n+1 with the interpretation that t n is the trait of <?, and 
tk the trait of his ancestor in the fc-th generation Qk, < k < n — 1. The typical genealogy is defined 
as the genealogy of an individual chosen at random in the n-th generation. This obviously requires the 
n-generation to be non empty, in which case we adopt the convention that the typical genealogy is 0. 
The distribution of the typical genealogy is given by 

5$ if Q n =% 

This is a random measure on 7~ n+1 U {0}. We denote by Pjv.7r ,^ the probability measure corresponding 
to a population evolving in environment w, starting a time from N individuals with traits i.i.d. with 
distribution t:q. In the following, we focus on the typical genealogy in the infinite population limit 
N — > oo. Recall the definition of the Markov chain T — (T„)„>o given by equation ©. We have the 
following mean field result: 

Theorem 2. Under the probability P_/v l7 r .w; the typical genealogy distribution ir n almost surely weakly 
converges as N — > co to the distribution TT n>7T0jUJ defined by 

* / ^ _ E^ [M»(I>)l (To ,..., Tti) s A ] +1 
E Wj7ro [M n (T,uj)\ 



Recall from Proposition 1121 that E Wi7ro [M n (T, u>)] = exp(n7„(7To, ui)) is the mean number of individuals 
in the n-th generation of a population evolving in environment u) and starting from a single individual 
with random trait with distribution ttq. 
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Proof of Theorem^ and Proposition [7H Let A = Ao x • • • x A n be a product subset of T n+1 . The 
number of individuals in the n-th generation with genealogy in A is 

M n {A)= S m (A) 

and A/"„(T n+1 ) denotes the total number of individuals in Q n . We can see Af n as a non-normalized 
measure and 7r n is the probability measure associated with by the relation 

n ' n = N n (T n + 1 ) 1{Arn(Tn+1)>0}Afn + 1 {^( r " +1 )=°} (5 0- 

From the branching property the distribution of Af n (A) under Pjv.7r ,w is equal to the sum of iV inde- 
pendent copies J2iLiNn\A) under Pf„. u . As a consequence of the weak law of large numbers, the 
distribution of jjAf n (A) under Pjv> 0iU , weakly converge to Ei. 7r0iW (A/'„(A)). The results also holds for 
A = T n+1 and taking the quotient, we see that under Pjv,7r ,o>! K n (A) weakly converge to 

" 71,7To ,UJ \^*-J 



E w (JV„(T»+i)) 
provided the denominator is non zero. Note that 

7„(w )7 r ) = n- 1 logE 1 ^ ^(AA„(r"+ 1 )). 
The theorem and the proposition (letting A = T n ) are then a consequence of 

/n—l 
A k=0 
= E W)Wo [M„(T,o;)l( r0j ... iT „) eyl ]. 

This last relation is proven by induction: individuals in the n-th generation with genealogy in A are the 
offsprings with traits in A n of individuals in the n — 1 generation with genealogy in Aq x • • • x A n —\. 
This yields 



^(M.IA)) = / E lt ^ iW (A/;_i(i4o 



x • • • x A„_ 2 x dt n -i))m tn _ 1 (di„). 



For n = 0, Ei iWO)(i; (A/"o(Ao)) = 7To(A). It remains to note that Ei ! , ro . w (A/', l (T n+1 )) ^ and this implies 
that the mass of vanishes in the limit. □ 



6.2 The typical genealogy in the non-hereditary case 

When the traits are non-hereditary, the trait distributions Tt t , e do not depend on t and the mean field 
typical genealogy distribution TT„ jUtVo has a very simple form. The following Proposition is given in the 
context of a population with a sensing mechanism 7r = ir((p e ) e< =£). The no-sensing case corresponds to 
the particular case when p e = p. 

Proposition 13. In the non-hereditary case, the typical genealogy distribution 7r njWj7ro is the product 
measure on T n+1 defined by 

K n ,Lj,-n (dto, • • • , dt n ) — <X>" =0 7ri,w, (dU) 

where 

< Tti,ui{dU) = m Pui-AdU), l<i<n-l, . 

" L PLO i _ 1 

, Ttn,oj n (dt n ) = Pu n -i\dt n ) 
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Proof. In the non-hereditary case, the Markov chain in random environment u> defined by equation © 
is simple because conditionally on the environment ui, the random variables To, ■ ■ ■ ,T n are independent 
with To distributed as ttq and, for 1 < i < n, Ti distributed according to Let A = Aq x • • • x A n C 

T n+1 . Using independence and Theorem [2 we compute: 

[Af n (T,w)l (Ta) .„ )Tw)6jl ] 



E W)7ro [M„(T,a;)] 

[ m T ,uo^T £A ] X • • • X 

K,u ] X • • • X \m r p n _ 1 

,Wri- 1 J 

This proves the independence property and gives the marginal distributions: 

MA<) = \ o[mT r lT V i] > !<<<n-l. 



□ 



The interpretation of the above proposition is the following: the mean field typical genealogy in en- 
vironment oj consists in independent traits Tq, ■ ■ ■ ,T n where 

- the distribution of Tq is a biased version of iro(dto) with bias function equal to m to , Uo the mean number 
of offspring's of an individual of type to, 

- the distribution of Ti is a biased version of p U i i (dti) with bias function equal to m tut0i , 

- the distribution of T n is p Ui (dti) (there is no bias because the offspring of the last generation Q n is not 
involved since the population is considered until time n only). 



6.3 The typical genealogy in a hereditary case: Haccou and Isawa's example 
continued 

Recall that T = £ — K, u> is a Gaussian stationary ergodic sequence with stationary distribution jV(/z, erf) 
and m t e is given by 

{t-ef 



c 



m t ,, 



, exp - 



2af 



It remains to precise the trait distributions 7Tt je and ttq. To make the model explicitly solvable, we require 
the Markov chain T = (Tq, ■ ■ ■ , T n ) in environment u> to be multivariate Gaussian. This impose that the 
transition 7r tje are of the form 7r tje = Af(a e t + [3 e , 9 e ) with 

E[Ti \u) = e, T = t] = a e + (3 e t and Var[Tj \u) = e,T = t] = 9 2 e . 

Let 7To = W(moj Sq) be the initial distribution. Alternatively, we have the representation in environment 

T = Mo + sqN , 

T k+ i = a Uk + f3 Uh T k + 6 Uh N k+ i 0<k<n-l, 
with No, ■ ■ ■ , N n independent standard normal variable. We introduce the (n + 1) x 1 vectors 

/ Mo \ 



to 



T = 



( To \ 



N = 



and the (n + 1) x (n + 1) matrices 



a 



( *° \ 



A,,, — 



/ 
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With these notations, the recursive relation turns into 

T = A u + C W T + S U N 

and this yields 

T = (Id-C u) )-\A w +S UJ N). 
We deduce the mean /j, and covariance matrix E for the Gaussian vector T 

H=(Id-C u) )- x A w and V = {Id-C u )- 1 S u {Id-Cfj)- 1 . 

Suppose that S u is invertible, then T has density 

f T {t) = (2^)-(" +1 )/ 2 det(E)" 1 / 2 exp _ ^)' E -i( t _ . 

Denote by T = (To, • • • , T n ) a random vector with distribution TT n ,ui,Tr - According to Theorem^ T is a 
biased version of T and has a density given by 

f f (t) = e- n ^^M n (t,u;)f T (t) 

= e-^»^' 7ro )C" ( 7r"(27r)-"- 1 / 2 det(S)- 1 / 2 e X p ( -ht - M)'E _1 (t - M) - A £>i - U) 2 J . 

\ CTl i=0 / 

Introducing J ni i the diagonal matrix J nj i = (^o<i=j<n-i)o<i,j<n an d V n>u) the column vector V n . u = 
(u)q, • • • , w n _i, 0)', the last exponential factor rewrites 

exp (-^'(s -1 + ^rv„ : i)i + (irv + ^rv„, j'< - V - ^r 2 K,«^., w ) • 

We recognize that ff is a multivariate Gaussian density with mean A and covariance matrix E of the 
form 

/ f (t) = (2^)-(» +1 )/ 2 det(S)- 1 / 2 exp (-±{t - {i)'t~ x {t - A)) • 

Identifying both expressions, we obtain after simplification 

S = (S^+aj-Vn.i)" 1 , (7) 
A = E (E~V + ^ 2 K, W ) = (E- 1 + a^J^y 1 (E~V + ^fXw) , (8) 
7n (w, 7T ) = log -=£ J- logdet(I n+ i + af 2 J„.iE) + i- (A'E^A - /i'Ep - (Tf 2 ^ ^Vn.S) .(9) 



These computations prove the following result: 

Proposition 14. In the Haccou and Isawa model with Markov Gaussian environment, the typical geneal- 
ogy distribution Tt n ,u,n is the Gaussian distribution on M™ + with mean A given by © and covariance 
matrix E given by ([8]). Furthermore, the finite time growth rate 7«(w, ttq) is given by ([9]). 
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